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PageRank is arguably the most popular ranking algorithm which is being applied in real systems 
ranging from information to biological and infrastructure networks. Despite its outstanding popu¬ 
larity and broad use in different areas of science, the relation between the algorithm’s efficacy and 
properties of the network on which it acts has not yet been fully understood. We study here PageR- 
ank’s performance on a network model supported by real data, and show that realistic temporal 
effects make PageRank fail in individuating the most valuable nodes for a broad range of model 
parameters. Results on real data are in qualitative agreement with our model-based findings. This 
failure of PageRank reveals that the static approach to information filtering is inappropriate for a 
broad class of growing systems, and suggest that time-dependent algorithms that are based on the 
temporal linking patterns of these systems are needed to better rank the nodes. 


I. INTRODUCTION 

With the amount of available information constantly 
growing due to the widespread usage of computers and 
the Internet, network-driven information filtering tools 
such as ranking algorithms mia and recommender sys¬ 
tems [3] attract attention of researchers from various 
fields. PageRank, one of the most popular ranking al¬ 
gorithms, has been originally devised to rank web sites 
in search engine results [4]. The algorithm acts on uni- 
partite directed networks and builds on the circular idea 
node is important if it is pointed by other impor¬ 
tant nodes The essential role that PageRank plays in 
the Google search algorithm has stimulated extensive re¬ 
search of its properties [5] and relations to previous rank¬ 
ing techniques [6]. PageRank has been applied far be¬ 
yond its original scope: in ranking of scholarly papers [ 7 ], 
authors H E] and journals uni, ranking of images in 
search m, ranking of urban roads according to traffic 
flow [12] , measuring the importance of biochemical reac¬ 
tions in the metabolic network m, for example. The 
algorithm’s remarkable stability properties make 

it a suitable candidate to rank nodes in noisy networks 
such as the World Wide Web (WWW) and the protein 
interaction networks, where the information is often not 
completely reliable. Variants of PageRank include Eigen- 
trust which computes trust values in distributed peer-to- 
peer systems m, LeaderRank which computes influence 
of users in social networks m, and CiteRank which uses 
a model of citation network traffic to compute the impor¬ 
tance of scientific papers m, among others; variants of 
PageRank have been also applied to bipartite networks 
[THHin] and multilayer networks m- 

The widespread usage of PageRank motivates us to 
ask: when is the algorithm effective in ranking nodes ac¬ 
cording to their quality? Are there circumstances under 
which the algorithm is doomed to fail? Answering these 
questions is of primary importance to foster our under¬ 
standing of the ranking algorithm, which is a problem 
of practical significance given the influence of ranking- 
based tools such as search engines and recommendation 
systems on many aspects of our society, from marketing 


to politics [ 22 H 25 ] . While previous research has already 
studied the rankings produced by PageRank for different 
topological properties of the input networks m, the eval¬ 
uation of the algorithm on networks that evolve in time 
remains a largely unexplored field. The main aim of this 
work is to fill this gap and demonstrate the shortcomings 
of the algorithm when applied to growing networks ex¬ 
hibiting temporal effects. To this end, we use a growing 
directed network model with preferential attachment and 
relevance [26] which generalizes the classical preferential 
attachment introduced in EH- This model (hereafter the 
Relevance Model, RM) has been shown by maximum like¬ 
lihood analysis to be the preferential attachment model 
that best explains the linking patterns in real information 
systems [28] and has been used to model real information 
systems, such as the WWW [29], citation networks [30] . 
online networks [28], and even technological networks, 
such as the network of Internet autonomous systems m- 
In the RM, three essential elements rule the compe¬ 
tition among nodes for incoming links: preferential at¬ 
tachment, fitness and temporal decay. Preferential at¬ 
tachment is a well-established mechanism that has been 
observed in a wide range of real systems (see [32] [33] for a 
review). Fitness is a quality parameter assigned to each 
node that quantifies the node’s inherent competence in 
attracting new incoming links [34] ; all other things being 
equal, in a competitive environment high-fltness nodes 
are suitable for success in the system and are likely to 
become eventually popular, whereas low fitness nodes 
tend to remain little known [29]. Node fitness is mod¬ 
ulated with a time-decaying function which gives rise to 
the so-called node relevance [26]: a node of high-fltness 
thus initially has high relevance and potentially attracts 
many links but this relevance eventually vanishes and the 
node ceases to attract new links. Fitness and relevance 
discount all system-dependent intangible and subjective 
factors that determine node’s quality, quantify how much 
a node is attractive to a given system and can be esti¬ 
mated on real data by different techniques [26] [28H3Q] . In 
our model, each node is further endowed with an activity 
parameter which represents the rate at which the node 
creates new outgoing links; activity too is modulated 
with time. We use the model to produce artificial data 


2 


and compare the ranking of nodes by their indegree (i.e., 
the number of incoming links) and PageRank score with 
the node ranking by their fitness values. We find that 
when model parameters for the temporal decay of rele¬ 
vance and activity substantially differ from each other, 
the redistribution of PageRank scores is biased towards 
old or recent nodes, respectively (depending on which de¬ 
cay is faster). In addition, when PageRank is temporally 
biased in either way, indegree markedly outperforms it 
in ranking nodes by their fitness. These results are con¬ 
firmed on a modified model, so-called Extended Fitness 
Model, where high-fit ness nodes preferentially attach to 
other high-fitness nodes, whereas low-fitness nodes pref¬ 
erentially attach to popular nodes. While in this model 
PageRank can significantly outperform indegree in re¬ 
producing the ranking of nodes by their fitness for some 
model parameters, extensive parameter regions where the 
algorithm fails and performs worse than indegree are still 
present. 

We finally apply PageRank on two real datasets, the 
social network of Digg.com users and the network of ci¬ 
tations between American Physical Society (APS) scien¬ 
tific articles, and compare the rankings of nodes by their 
indegree and PageRank score with the node ranking by 
their total relevance which is a real-data estimate for fit¬ 
ness. We find that while PageRank score is highly cor¬ 
related with indegree in social network data and the two 
metrics have similar performance, PageRank is markedly 
outperformed by indegree in citation data. These find¬ 
ings strongly discourage the use of PageRank in systems 
where strong temporal patterns exist, like citation net¬ 
works. 


II. RESULTS 
A. Relevance Model (RM) 

In the RM, when a node j creates a new link at time t, 
the probability 11 •’^(t) that it chooses node i as the target 
is assumed to be 

+ ( 1 ) 

where kY^{t) is the current indegree of node i, 77 ^ is its fit¬ 
ness and /r is a function of the node’s age (r^ is the 
time at which node i enters the system). The prod¬ 
uct r]i fR{t — Ti) := Ri{t) represents the relevance of 
node i at time t |26l [28]. We assume that /i?(t) de¬ 
cays monotonously and thus mimics real situations where 
nodes lose relevance over time. Previous studies of the 
RM [26l [30] have focused on scientific citation networks 
which are tree-like because nodes create outgoing links 
only in the moment when they enter the system - the 
links are thus always directed back in time. We consider 
a general situation where nodes continue being active, 
create outgoing links continually, and the resulting net¬ 
work thus contains loops which are common in many real 


systems, such as the WWW, for example. We use the ac¬ 
tivity potential approach introduced in [35] and assign to 
each node i an activity parameter A^. At each simulation 
step, a new node is created and connected to an existing 
node. In addition, m = 10 existing nodes are sequen¬ 
tially chosen and create one link each (see the Methods 
section for all simulation details). The m nodes that are 
active at time t are chosen with the probability 

( 2 ) 

where /a(^) is a monotonously decaying function of time. 
A broad distribution of the activity parameter A allows 
us to reproduce broad outdegree distributions typically 
found in real networks [33] without resorting to preferen¬ 
tial linking mechanisms for outgoing links. 


B. Decay of empirical relevance and activity in real 

data 

We now analyze real data to validate the hypothesis of 
relevance and activity decay. We refrain from maximum 
likelihood analysis [28] because of its computational com¬ 
plexity. Instead, we follow a simpler procedure: following 
[26] . we define the empirical relevance r^(t) of node i at 
time t as 


ri(t) 


ni(t) 


( 3 ) 


Here n^(t) = At)/L(t, At) is the ratio between 

the number A/cJ’^(t, At) of incoming links received by 
node i in a suitable time window [t,t + At] and the 
total number I/(t, At) of links created within the same 
time window, whereas nf^(t) = ^J^(t)/ 5 ^^- is the 

expected value of ni{t) according to preferential attach¬ 
ment alone. Empirical relevance ri{t) larger or smaller 
than one means that node i at time t outperforms or 
underperforms, respectively, with respect to its prefer¬ 
ential attachment weight in the competition for 

incoming links. 

The hypothesis of time-dependent and heterogeneous 
relevance has already been validated in the APS scientific 
citation network [26]. Here we further analyze the APS 
dataset, described in the Methods section, finding (Fig. 
S 2 ) that the decay of relevance is well reproduced by a 
power law function (see the Supplementary Note S 2 for 
detailed results). Moreover, we validate the hypothesis 
of relevance and activity time decay in a very different 
system, the Digg.com social network of users, where a di¬ 
rected link between two users means that one user follows 
the other (see the Methods section for the description of 
the dataset). We find (Fig.SI) that relevance decays also 
in this dataset. Based on [35], we define the empirical 
activity ai{t) of node i at time t as the ratio between the 
number of outgoing links created by node i in a suitable 
time window [t, t -h At] and the total number of links cre¬ 
ated within the same time window. We find (Fig. SI) 
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FIG. 1. Illustration of a network produced with the RM for fast aging of relevance and constant activity (color 
online). In each step, a new node is introduced and connected to an existing node (arcs above the row of nodes). In addition, 
a randomly chosen node becomes active and connects to an existing node (arcs below). The target node is chosen by Eq. 0 
in both cases (see Supplementary Note S5 for model parameters). The orange and red part of the each link mark the initial 
and target node, respectively. Note that while old nodes point to nodes of every age thanks to constant activity, recent nodes 
never point to the old nodes due to the decay of relevance. This asymmetry results in PageRank scores biased towards recent 
nodes. 



FIG. 2. PageRank time bias (color online). We show here 
the average entrance time r of the top 1% nodes of the node 
ranking by indegree and PageRank, respectively, as a function 
of the relevance decay parameter Or. Networks of iV = 10000 
nodes are grown with the RM with slow decay of activity 
{Oa = N). Two limits of PageRank bias are visible: (1) When 
the decay of relevance is fast {Or Oa), a large number of 
top nodes are recent as a consequence of the network structure 
demonstrated in Fig. (2) When the decay of relevance is 
slow {Or ~ N), top nodes are old because the old nodes can 
be pointed by nodes of every age. While the latter bias is 
common to PageRank and indegree, the former bias is specific 
to PageRank because of its network nature. 


that also activity decays with time, and activity decay is 
slower than relevance decay (see Supplementary Note SI 
for details). 


C. Results of numerical simulation with the RM 

For the sake of generality, we consider both exponen¬ 
tial and power-law decay functions /i?(t) = exp {—t/OR), 
fA{t) = exp{-t/eA) and /^(t) = /^(i) = 

respectively. Our main goal now is to study the depen¬ 
dence of PageRank performance on model parameters 
0r,0a and aR^aA, respectively. We refer to the Meth¬ 


ods section for the mathematical definition of PageRank 
and details about the choice of fitness and activity dis¬ 
tributions in simulations. 

A good ranking algorithm is expected to produce an 
unbiased ranking where both recent and old nodes have 
the same chance to appear at the top. In growing net¬ 
works with temporal effects, PageRank can fail to achieve 
this. To explain the origin of this failure, we consider 
two extreme situations: relevance decay which is very 
fast and slow, respectively, with respect to activity de¬ 
cay. When relevance decay is slow (or absent, as in 
the original fitness model [34]), recent nodes receive few 
links because their weight in preferential attachment is 
much smaller than the weight of all nodes that have al¬ 
ready accumulated many links (this manifests itself in the 
network’s strong dependence on the initial configuration 
m)- PageRank as well as indegree are therefore strongly 
biased towards old nodes. When relevance decay is fast, 
preferential attachment is compensated by a quick decay 
of relevance and therefore recent nodes can reach high 
indegree. However, there is now an essential asymmetry 
in the system which relates to outgoing links: while re¬ 
cent nodes mostly point to other recent nodes because 
of relevance decay, old nodes point to nodes of every age 
because they remain active during the whole system’s 
lifetime (see Fig. [^for an illustration). PageRank is con¬ 
sequently biased towards recent nodes: while a random 
surfer at an old node is likely to jump to a recent node, 
the converse is not true; recent nodes effectively act as 
an attractor. 

Fig. [2] shows a transition between the two extreme 
cases for artificial data produced by the RM with expo¬ 
nential relevance decay and exponentially distributed fit¬ 
ness. When the decay of relevance is slow {Or = 10000), 
there are only old nodes at the top 1% positions of the 
rankings by PageRank score and indegree. When the de¬ 
cay of relevance is fast {Or = 10), recent nodes occupy 
the majority of the top 1% positions in the ranking by 
PageRank score. By contrast, the ranking by indegree is 
essentially unbiased in this limit as the average entrance 
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time r of the top-1% nodes is close to N/2 = 5000 which 
corresponds to the absence of time bias. 

We discuss now the implication of PageRank’s time 
bias on the algorithm’s ability to rank nodes by fitness. 
In the following, we denote by r(p, r^) the Pearson’s cor¬ 
relation between the PageRank scores p and the fitness 
values ? 7 , and we denote by the Pearson’s corre¬ 

lation between node indegree and fitness. Fig. shows 
the performance ratio r{p,r]),r]) in the {Or^Oa) 
plane. Since r{p,rj)/r{k'^^,r]) < 1 everywhere, we find 
that PageRank yields no improvement with respect to in¬ 
degree in ranking nodes by fitness. This is because while 
the PageRank algorithm assumes that important nodes 
point to other important nodes, this feature is absent in 
the RM where all nodes are driven by the same mech¬ 
anism, Eq. when choosing their connections. As a 
result, PageRank does best in comparison with indegree 
along the Or — Oa diagonal where PageRank is not tempo¬ 
rally biased and r{p^r])/r{k'^^^rj) becomes close to, albeit 
always strictly lower than, one. When moving away from 
this diagonal, PageRank score has temporal bias towards 
recent or old nodes (Fig. S 6 ), its correlation with inde¬ 
gree (Fig. S7) and fitness (Fig. S 8 ) decrease, and it 
reproduces fitness substantially worse than indegree (red 
areas in Fig. [^. Qualitatively similar behavior is found 
for the RM with uniformly distributed fitness (Fig. S9), 
power-law decay of relevance and activity (Fig. SIO), ac¬ 
celerated growth rate (m(t) oc t instead of m{t) = 10 , 
Fig. S 12 ). The same is true when the ranking quality 
is measured by the precision metric Pioo(',^), (defined 
as the number of fitness top -100 nodes placed in the top 
100 of the ranking produced by an algorithm), instead of 
the linear correlation coefficient (Fig. Sll). This shows 
that our findings are robust and do not require a specific 
model setting. 


D. An extended model based on fitness 



relevance decay 6 r 


FIG. 3. A comparison of performance of PageR¬ 
ank and indegree in the RM data (N=10,000, p{r]) = 
exp (—77), color online). The heatmap shows the ratio 
r{p,rj)/r{k^^ , 7 ]). The black dotted line represents the con¬ 
tour along which PageRank is not temporally biased (see Fig. 
S6, left). The upward bending of this contour is a finite-size 
effect. 



To demonstrate that PageRank’s under-performance 
with respect to indegree is a general feature, we now pro¬ 
ceed to a different model for artificial data which is more 
compatible with PageRank’s basic idea that a node is im¬ 
portant if it is pointed by other important nodes. In this 
model (hereafter Extended Fitness Model, EFM), high- 
and low-fitness nodes differ not only in their ability to at¬ 
tract new incoming links, but also in their sensitivity to 
the fitness of the other nodes when choosing their outgo¬ 
ing connections. High-fit ness nodes are highly attractive 
to new incoming links as well as highly sensitive to fitness 
of the others when choosing their outgoing connections. 
Low-fitness nodes are basically insensitive to fitness and 
choose their target nodes mostly by current popularity 
amended by aging. High-fitness nodes are then more 
likely to be pointed by other high-fitness nodes than low¬ 
fitness nodes (see Fig. S5) which agrees with the basic 
premise of PageRank: important nodes are pointed by 
other important nodes. We therefore expect PageRank 


FIG. 4. A comparison of PageRank and indegree cor¬ 
relation with fitness in the EFM data {N = 10,000, 
H = 250, color online). The heatmap shows the ratio 
r{p,q)/r{k^^,q). The white dotted line represents the con¬ 
tour where PageRank is not temporally biased (see Fig. S6, 
right). 

to outperform indegree in ranking the nodes by fitness. 
The model assumes that the probability H^^ that a link 
created by node j at time t ends in node i has the form 

n*” (i) - {krit) +vT fR{t - n) (4) 

where node fitness 77 is now constrained to the range [ 0 , 1 ] 
to prevent a negative exponent 1 — 77 in the first term. 
We stress that the probability H*’^ depends not only on 
the fitness pi of the target node, but also on the fitness 
Pj of the node j that creates the outgoing link, which is 
a new element with respect to the RM. A similar model 
has been used to model user-item networks in [37]. We 
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FIG. 5. A comparison of PageRank and indegree cor¬ 
relation with total relevance in real data and in cal¬ 
ibrated simulations with the RM. PageRank is outper¬ 
formed by indegree in both datasets (and in the corresponding 
calibrated simulations). In the Digg.com social network, the 
fitted relevance and activity power-law decay exponents are 
not far from the parameter region where PageRank is maxi¬ 
mally correlated with indegree in numerical simulations with 
the RM with power-law decay (see Fig. SIO), and PageR- 
ank’s and indegree’s correlation with total relevance are close 
to each other. By contrast, in the APS dataset activity decays 
immediately, whereas relevance decays progressively (see Fig. 
S2); as a consequence, PageRank is strongly biased towards 
old nodes (see Fig. S3) and is outperformed by indegree by a 
factor 2.58 [r(p, T) = 0.19 whereas r{k'^^,T) = 0.49]. We refer 
to the Supplementary Note S3 for details about the simula¬ 
tion calibration on real data and to the Supplementary Note 
S4 for details on the computation of empirical relevance in 
real and artificial data. 


assume that a small number H of nodes have high fitness 
{t] G [10“^, 1]) and the remaining N — H nodes have low 
fitness {rj G [0,10“^), see the Method section for details). 

Fig. a shows the results obtained with the EFM. The 
correlation coefficient r{p,k'^^) (Fig. S7, right) and the 
average age of top 1% nodes (Fig. S6, right) have qualita¬ 
tively the same behavior as for the RM which indicates 
that the behaviour of these quantities as a function of 
model’s temporal parameters is universal and indepen¬ 
dent of the exact growth rule. The model is favorable 
to PageRank and indeed, the algorithm now can signif¬ 
icantly outperform indegree in terms of the correlation 
between fitness and node score when PageRank is not 
temporally biased (blue area in Fig. [^. Nevertheless, 
PageRank still underperforms indegree in two extensive 
regions of the parameter plane {Or^Oa)- As for the RM, 
these two regions correspond to the cases where activ¬ 
ity and relevance decay timescales substantially differ. 
These results are again confirmed by using power-law 
aging instead of exponential (Fig. SIO) and the preci¬ 
sion metrics instead of the correlation coefficient (Fig. 
Sll). Note that we introduced here the EFM to show 
that PageRank’s bias occurs also in a setting favorable to 
the algorithm; while it seems plausible that some nodes 
are more sensitive to fitness than others when making 
connections, we leave real data validation of the EFM 
for future research. 


E. Comparing indegree and PageRank: results in 
real networks 

Algorithm evaluation in real data is made difficult by 
several factors. In general, it is impossible to objectively 
evaluate node importance in a system because it depends 
on many intangible and subjective elements [6]. To as¬ 
sess the performance of ranking algorithms on real data, 
we compare node score with total relevance Ti = Vi (t) 
which is an estimate of node fitness (see Ref. [26] and 
the Supplementary Note S4). Results on real data and 
the corresponding calibrated simulations with the RM 
are reported in Fig. Our calibration procedure for 
simulations focuses on temporal decay of relevance and 
activity and is described in detail in the Supplementary 
Note S3; more accurate calibration is possible but goes 
beyond the scope of our work. Uncertainty of these 
results estimated by sample-to-sample fluctuations and 
non-parametric bootstrap [38] for model and real data, 
respectively, is of the order of 10“^ which is negligi¬ 
ble in comparison with the observed differences between 
PageRank and indegree (see Supplementary Note S6). 

In the Digg.com social network, the empirircal rele¬ 
vance and activity power-law decay exponents are not 
far from the parameter region where PageRank scores 
are maximally correlated with indegree in the simula¬ 
tions with the RM with power-law decay (see Fig. SIO), 
which is in qualitative agreement with the observed high 
value of correlation between PageRank and indegree in 
the dataset {r{k'^^,p) = 0.88); PageRank is outperformed 
by indegree in ranking nodes by their total relevance but 
the performances of the two metrics are relatively close 
to each other (see Fig. [^. 

In citation data, where the use of PageRank and other 
algorithms inspired by PageRank has been much stud¬ 
ied m nil ES], activity and relevance decays necessar¬ 
ily mismatch: relevance progressively decays with time 
[26] . whereas activity decays immediately. In the APS 
dataset we find that PageRank is significantly biased to¬ 
wards old nodes (Fig. S3): this is because old papers can 
be pointed by papers of every age, while recent papers 
are pointed only by recent papers. This is the opposite 
time bias than that depicted in Fig. Moreover, we 
find that PageRank and indegree are weakly correlated 
[r{k'^^^p) = 0.52], and indegree is remarkably better cor¬ 
related with total relevance than PageRank (see Fig. [^. 
These findings are consistent with the outcomes of a cal¬ 
ibrated numerical simulation with the RM (see Fig. [^, 
where all outgoing links of a node are created when the 
node enters the system and the outdegree distribution 
is exponential as in the APS dataset (Fig. S4), see the 
Supplementary Note S3 for details about simulations cal¬ 
ibrated on real data). Note that the age distribution of 
top nodes in indegree and PageRank ranking in the APS 
real network and the artificial network generated by the 
corresponding calibrated simulation have the same quali¬ 
tative shape (Fig. S3). This confirms that our simulation 
calibration on real data, which is based only on the tern- 
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poral patterns of the system, qualitatively captures the 
temporal bias of PageRank. 

We conclude this paragraph with a consideration on to¬ 
tal relevance T. Motivated by the high correlation r(T, r]) 
between node total relevance and fitness found in the cal¬ 
ibrated simulations (see Supplementary Note S4), in this 
work we use total relevance T as a proxy for node fit¬ 
ness in the real data. In the RM, we find that node 
total relevance outperforms indegree and PageRank in 
ranking nodes by fit ness for a broad range of model pa¬ 
rameters (Fig. S13). By contrast, the parameter region 
where total relevance outperforms indegree and PageR¬ 
ank is smaller in data produced with the EFM (Fig. |S14[ ). 
We leave for future research detailed investigation of how 
the performance of total relevance in ranking nodes by 
fitness depends on the assumptions and parameters of 
the underlying model. These findings might also moti¬ 
vate future study of the ranking of nodes by their total 
relevance in real data that are well-described by the RM. 


III. DISCUSSION 

To summarize, our numerical simulations indicate that 
the mismatch between the timescales of relevance and ac¬ 
tivity decay makes PageRank scores biased towards re¬ 
cent nodes (when the decay of relevance is faster) or old 
nodes (when the decay of activity is faster). This tempo¬ 
ral bias reduces PageRank’s capability to rank nodes by 
fitness and causes it to underperform in comparison with 
the elementary ranking of nodes by indegree in the RM 
which is to our best knowledge the most accurate model 
for describing growing information networks [28l|30]. Our 
findings are robust with respect to changes in the func¬ 
tional form of the time-decay function, in the distribu¬ 
tion of fitness among the nodes, and in the metric used 
to evaluate the ability of an algorithm to rank nodes by 
their fitness. We also studied a model (the EFM) that 
provides a favorable setting for PageRank performance; 
PageRank can outperform indegree on the data produced 
by this model, but fails again when the two timescales 
mismatch. Moreover, we find indications of the influence 
of temporal patterns on PageRank’s performance also in 
real data. In citation data, PageRank is excessively bi¬ 
ased towards old nodes and, as a consequence, is clearly 
outperformed by indegree in ranking nodes by their total 
relevance which is an estimate of node fitness (see Ref. 
m and the Supplementary Note S4). By contrast, inde¬ 
gree and PageRank perform similarly in social network 
data where there is not a sharp mismatch between ac¬ 
tivity and relevance timescales. The results of real data 
analysis are in agreement with our model-based finding 
that PageRank can only perform well if the two system’s 
timescales (of activity and relevance decay, respectively) 
are of similar magnitude. 

The methods developed and used in this article are 
general and can be applied to any growing directed net¬ 
work where nodes compete for incoming links and where 


preferential attachment and temporal effects influence 
the linking patterns, which includes a wide class of real 
networks. To diagnose whether a growing directed net¬ 
work is or not suitable for the application of PageRank, 
one can fit the empirical relevance decay and activity de¬ 
cay timescales on the data and run a corresponding cali¬ 
brated simulation which reveals whether PageRank is or 
not able to rank nodes according to their fitness. We have 
not attempted to study how our findings are affected by 
further real-world phenomena, such as link deletion [29] , 
popularity [40| and activity bursts im, among others. 
Link time stamps are crucial for our analysis; in all the 
datasets where they are not reported, we cannot com¬ 
pute neither node relevance r{t) nor node activity a{t) 
which exclude these systems from the range of applica¬ 
bility of our analysis. We also stress that the frame¬ 
work introduced in this work is not applicable to undi¬ 
rected networks, such as collaboration networks, scholar 
co-citation networks and road networks, among others. 
In undirected networks indeed there is no distinction be¬ 
tween incoming and outgoing links and, as a consequence, 
relevance and activity cannot be defined as two separate 
node properties. The model-based evaluation of PageR¬ 
ank’s performance in networks without time information 
and undirected networks is certainly an interesting and 
largely unexplored problem but goes beyond the scope of 
this work. 

The shortcoming of PageRank due to temporal effects 
is particularly worrying for applications of the algorithm 
to scientific citation data nni Eg sa. While PageRank 
can And old valuable papers underestimated by indegree 
[ 7 ], the algorithm is biased towards old nodes and as a 
consequence is outperformed by indegree in ranking pa¬ 
pers by importance, which strongly discourage the use of 
the algorithm to rank scientific papers. In this context, 
the need for including temporal effects in the algorithm 
has already been stressed in [n 1 ^ : the model-based 
approach introduced in this article leads us to the same 
conclusion. How to best include the temporal dimension 
in ranking scientific publications remains an open issue. 
One could consider a self-consistent algorithm that takes 
time into account, such as CiteRank m , or resort to 
fitness estimates, such as total relevance or maximum 
likelihood estimates [28]; the model-based approach in¬ 
troduced in this article provides a simple yet effective 
method - the comparison of node scores with intrinsic 
fitness in calibrated simulations - which could be used 
to establish which algorithm is more suitable for a given 
system. Our findings also bring new insights into the 
study of the relation between node indegree and PageR¬ 
ank. Previous studies [25] [43] established a linear rela¬ 
tionship between node degree and the average PageRank 
score for uncorrelated networks, and considered any de¬ 
viations from this behavior as fluctuations. We And that 
for a broad range of network parameters, much of these 
apparent fluctuations are in fact trends caused by the in¬ 
terplay between the network’s temporal features and the 
PageRank algorithm. 
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Our model-based evaluation of ranking algorithm is 
applicable also to the WWW. There is general agree¬ 
ment in recognizing the importance of PageRank in the 
success of Google’s search engine Em, yet it remains 
unclear which properties make the Web a suitable net¬ 
work where to apply the algorithm. While Ref. m em¬ 
phasizes the role of the scale-free topology of the Web on 
PageRank’s success, our findings stress the importance of 
temporal patterns in determining the success or failure of 
PageRank. Further data analysis on Web data could re¬ 
veal whether relevance and activity decay timescales are 
of similar magnitude in the WWW which would imply 
maximal correlation between PageRank score and node 
fitness, and thus provide a further explanation of PageR¬ 
ank’s success in this system. 

In conclusion, PageRank, despite its popularity and ro¬ 
bustness, can fail and thus it should not be used without 
carefully considering the temporal properties of the sys¬ 
tem to which it is to be applied. The connection between 
PageRank’s failure and the temporal features of the an¬ 
alyzed networks indicates that the main reason for the 
reported failure is the static nature of the algorithm. We 
believe that a well-grounded ranking algorithm should be 
built on the temporal patterns of the system where it is 
intended to be applied and the dependence of its perfor¬ 
mance on system features should be exhaustively stud¬ 
ied in model data where system’s structural and tempo¬ 
ral properties can be modified simply by changing model 
parameters. We believe that the model-based theoret¬ 
ical evaluation of ranking algorithms developed in this 
work will open the door to systematic performance eval¬ 
uation of algorithms in evolving systems, deepen our un¬ 
derstanding of their limitations, and lead to the intro¬ 
duction of new improved algorithms. 


IV. METHODS 


Digg.com dataset. Digg.com had been an online so¬ 
cial news aggregator from December 2004 to July 2012. 
Digg.com users were allowed to submit and vote (“digg”) 
stories. Interaction between users took place through 
comments and messages (see [45] for a detailed descrip¬ 
tion of the website). We studied the social network of 
users where nodes represent the users and a link from 
node i to node j means that user i is a follower of user j. 
The complete dataset in our possession covers the period 
from 06/08/2005 to 08/07/2009. We analyzed a 3-years 
subset running from 01/01/2006 to 31/12/2008. The 
subset consists of N = 190, 553 nodes and T = 1, 552, 905 
links. 

APS dataset. The APS (American Physical Society) 
dataset in our possession spans from year 1893 until 
2009 and contains N = 450,056 nodes (papers) and 
L = 4,690, 967 directed links (citations) between them. 
This dataset has been used in [26| to validate the hypoth¬ 
esis of heterogeneous and decaying relevance. 


PageRank. In a directed monopartite network com¬ 
posed of N nodes where no dangling nodes (nodes with 
zero outdegree) exist, the vector of PageRank scores {pi} 
can be found as the stationary solution of the following 
set of recursive linear equations 


Jt) 

Pi / J 0'^ j^out 


1 — C 

N 


(5) 


where A is the network’s adjacency matrix {Aji is one if 
node j points to node i and zero otherwise), is the 
outdegree of node j, c is the teleportation parameter, 
and t is the iteration number [4] . Eq. ^ represents the 
master equation of a diffusion process on the network, 
which converges to a unique stationary state indepen¬ 
dently of the initial condition [46]. The PageRank score 
Pi of node i can be interpreted as the average fraction 
of time spent on node i by a random walker who with 
probability c follows the network’s links and with proba¬ 
bility 1 — c teleports to a random node. We set c = 0.85 
which is the usual choice in practice [46]. Iterations are 
stopped when the modulus distance between the vectors 
of scores at two consecutive iterations becomes smaller 
than e = 10“^ [46] . 

Simulation details. We use the artificial models (RM 
and EFM) to build monopartite directed networks com¬ 
posed of V = 10, 000 nodes. We start from a configura¬ 
tion with two nodes, node 0 and node 1, and a link from 
node 1 to node 0. At each simulation step t, we add a 
new node t to the system and connect it to an already 
existing node. The target node is chosen according to 
the attachment rule 0 (RM) or @ (EFM). If t > 10, 
we also sequentially add m = 10 links between the ex¬ 
isting nodes. Their initial nodes are chosen according to 
the activity rule 0 ; the target nodes follow again Eqs. 
0 or 0 , respectively. The creation of multiple links 
between a pair of nodes and self-loops are prohibited. 
Unless stated otherwise, results are averages over 6 real¬ 
izations of the model. Error bars In Eig. [^represent the 
standard error of the mean which is generally small. The 
same is true for Figures [^ and [^ where only the average 
values are displayed. 

Fitness and activity distributions in the RM. When 
relevance decay is sufficiently fast to allow the normaliza¬ 
tion factor kl(t) of to converge within the simulation 
time scale [26], the average final indegree of a node in 
the RM depends exponentially on node fitness. Conse¬ 
quently, different fitness distributions yield different in¬ 
degree distributions [26] [34]. We use both exponential 
and uniform fitness distribution in our simulations; re¬ 
sults for the latter are shown in Fig. S9. The outdegree 
distribution is only determined by the activity distribu¬ 
tion p{A) (see Supplementary Note S7 for basic analyt¬ 
ical results). In our simulations we use p{A) = 2 A~^ 
for A G [1, oo) everywhere except for the calibrated APS 
data simulation where all outgoing links of a node are 
created when the node enters the system and we use 
p{k) = 8.33 exp (—0.12 k) for k G [0, oc) as the outdegree 




distribution, as found in the APS data (see Fig. S4). 

Fitness and activity distribution in the EFM. We 
choose here a fitness distribution that aims to emphasize 
the difference between the linking pattern of high- and 
low- fitness nodes without trying to reproduce structural 
features of real data. The set of fitness values consists 
oi N — H equidistant values within the interval [0,77t/i) 
(low-fitness nodes) and H equidistant values from the 
range [rjth^ 1] (high-fitness nodes). These values are then 
bijectively assigned to the network’s N nodes at random. 


We set a small value of the threshold r]th = 10“^ which 
implies that the low-fitness nodes are essentially insen¬ 
sitive to node fitness, while the high-fitness nodes range 
from little fitness-sensitive nodes to nodes almost unaf¬ 
fected by popularity and mainly driven by fitness (when 
gj = 1, we have 11^’^ ^ qi /^{t — r^)). We run simulations 
with H = 250 = A^/40 for Fig. this value is small 
in order to amplify the advantage of high-fitness nodes 
in connecting to other high-fitness nodes (see Fig. S5). 
As in the RM, we use p{A) = 2A~^ for A e [l,oo) to 
generate the node activity values. 
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Relevance decay 
Node group C a D 

> 100 61.31 1.09 0.50 

G [10,100] 25.00 0.99 0.46 

< 10_1.85 0.76 0.12 

Activity decay 

Node group C a D 

kOUt ^ ^QQ ^ J . ^Q-3 Q 33 _g Q . ^Q-5 

G [10,100] 2.2 • 10“^ 0.44 -1.86 • 10“^ 

^out ^ . ^q-5 q 44 33 . ^q-6 


TABLE I. Parameter estimation results for the average rele¬ 
vance and activity decay in the Digg.com data. 


SUPPLEMENTARY NOTES 


SI: Analysis of empirical relevance and activity in the 
Digg.com dataset. As explained in the main text, the 
empirical relevance r^(t) of node i at time t is defined as 


ri{t) = 


ni{t) 


(S 6 ) 


where ni{t) = Akl'^{t, At)/L{t^ At) is the ratio between 
the number Akl^{t, At) of incoming links received by 
node i in a suitably chosen time window [t, t + At) and 
the total number L(t^ At) of links created within the same 
time window, whereas nf^{t) = kl'^{t)/^j kf^{t) is the 
expected value of ni{t) according to preferential attach¬ 
ment alone m- The activity ai{t) of node i at time t 
has been defined in [35] as 


Akf^Dt.At) 

“•« = At. A.) 


(S7) 


where At) is the number of outgoing links cre¬ 

ated by node i in the time window [t,t -h At). We use 
At = 1 week to compute relevance and activity in the 
Digg.com dataset. 

Figure shows the average temporal decay of rele¬ 
vance and activity in the Digg.com dataset (see MM sec¬ 
tion of the main text for a description of the dataset). 
We fit both decay profiles with a power-law function 
f{t) = C t~^ + D with three parameters (C, a, D) 
using the least-squares method. While other and per¬ 
haps more accurate patameter estimation procedures ex¬ 
ist, the present results are sufficient for our analysis. We 
only consider the first 100 weeks after the first link re¬ 
ceived/created by the respective node. Parameter esti¬ 
mates are summarized in Table |T| separately for nodes of 
high, medium and low in-degree and out-degree, respec¬ 
tively. One may note here that activity decays slower 
(with a lower exponent) than relevance. 

S2: Analysis of empirical relevance in the APS dataset. 
We use At = 91 days to calculate empirical relevance of 
nodes in the APS dataset (see MM section of the main 
text for a description of the dataset). Fignreshows the 


Relevance decay 


Node group 

C a 

D 

> 100 

217.5 1.40 0.19 

fc"’" G [10,100] 250.8 1.51 

0 

< 10 

177.2 1.51 

0 


TABLE If. Parameter estimation results for the average rele¬ 
vance decay in the APS data. 


average relevance decay in the APS dataset; it is analo¬ 
gous to Figure 1 in [26]. Similarly as for the Digg.com 
dataset, we fit the results with the power law dependence 
f{t) = Ct~^ + D. To avoid the non-monotonous initial 
behavior of relevance (which is due to, for example, the 
time needed to carry out and publish research building 
on a given paper), we ignore the first 5 years (10 years 
for low indegree nodes) after publication. The estimation 
results are reported in Table [TT| 


S3: Simulations calibrated on real data. When cali¬ 
brating the numerical simulations on the Digg.com and 
APS datasets, we focus only on the datasets’ temporal 
patterns that constitute the main motivation of our study 
and are the principal reason for the reported failure of 
PageRank. While more accurate calibration of models 
to the real data is possible, we do not find it necessary 
because our calibrated simulations capture some basic 
temporal patterns of indegree and PageRank scores (see 
Figure!^. 

The artificial dataset calibrated on the Digg.com data 
is grown using the relevance model (RM) with p{r]) = 
exp(— 77 ) and power-law decay of relevance = 1 ) and 
activity (a a = 0.4); the other simulations details are the 
same described in the MM section of the main text. The 
artificial dataset calibrated on the APS data is grown 
using the RM with p{r]) = exp(— 77 ) and power law decay 
of relevance {aR = 1.4); all outgoing links of a node 
are created when the node enters the system and the 
outdegree distribution is p{k) = 8.33 exp(—0.12/c) as in 
the APS data (see Figure S4). 


S4: Measuring empirical relevance in real and artificial 
data. Since fitness values are not known in real data, we 
use the total relevance defined by Eq. (§ as an estimator 
of node fitness. As shown for the RM in [26], total rele¬ 
vance and fitness are closely connected and both provide 
information about the perceived importance of a node. 
However, the direct use of Eq. poses a problem in 
artificial data. In our numerical simulations, a constant 
number of link are added to the system at each time 
step. The factor k'p{t) on the right side of Eq. (§ 
consequently grows linearly with simulation time and, as 
a result, the total relevance computed with Eq. (§ is 
biased towards recent nodes. This issue does not occur 
in real data where both I/(t, At) and kf^ grow with 
time. To avoid this bias, we omit the factor kf^ when 
computing relevance in real data and use the following 
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definition 




ni{t] At) 
At)' 


(S 8 ) 


The corresponding definition of total relevance for model 
data is Ti{t; At) := At); we use At = 20. This 

quantity is used in Figure in the main text to compare 
the rankings by indegree and PageRank on calibrated ar¬ 
tificial data. In these simulations, we find r(T, 77 ) = 0.71 
and r(T, 77 ) = 0.65 for the RM calibrated on the Digg.com 
and APS dataset, respectively. High values of the corre¬ 
lation between T and 77 confirm that total relevance is a 
suitable estimator of a node’s intrinsic fitness. 

S5: Production of Figure in the main text. To pro¬ 
duce Figure we start from a network with two nodes: 
node 0 and node 1 , and one link between them (from 1 
to 0). The final network consists of A = 40 nodes and 
is grown up according to the RM. Node fitness is drawn 
from the exponentia distribution p{r]) = exp(— 77 ). Node 
relevance decays exponentially with = (0.4)^. At 

each simulation step, a new node is added to the system 
and connected to an existing node according to Eq. 
in the main text. Consequently, one new link is created 
among the existing nodes (so-called internal link). While 
the target node is again chosen according to Eq. the 
starting node is chosen at random from the existing nodes 
which corresponds to constant node activity. 

S6: Assessing the uncertainty of results. Non- 

parametric bootstrap is a statistical method to estimate 
the error on quantities measured in real data [38] . To es¬ 
timate the errors of the correlation coefficients r{k'^^^T) 
and r{p,T) for a real dataset, we create new datasets 
by resampling with repetition from the given dataset. 
Since a resampled dataset can in principle contain multi¬ 
ple links between a pair of nodes, we compute PageRank 
on the resampled data using the generalized formula 




d+i) _ 




it) 


1 — c 
N ’ 


(S9) 


where Mji is the number of directed links from j to 
i (and correspondingly The correla¬ 

tion values of interest can be computed for resampled 
dataset. The standard deviation of these results over 
datasets then characterizes the uncertainty of the origi¬ 
nal correlation values. Eor the Digg.com data, we obtain 
r{¥^,T) = 0.330 ± 0.003 and r{p,T) = 0.224 ± 0.003 
which means that the uncertainty is small and insignifi¬ 
cant in comparison with the absolute differences between 
the correlation values. Results for the APS data lead to 
the same conclusion. 

Eor the results obtained with calibrated simulations, 
we estimate their uncertainty by analyzing several model 
realizations and evaluating the standard error of the 
mean for a quantity of interest. Using 50 model reali¬ 
sations, we find r{k^'^,T) = 0.444 ± 0.002 and r{p,T) = 
0.411 ± 0.002 which has the same implications as before: 
the results’ uncertainties are substantially smaller than 


the absolute difference and thus unsignificant. Results 
for the APS data lead to the same conclusion. 

S7: Relation between outdegree and activity. Eq. 
which governs the creation of outgoing links, does not 
contain the preferential attachment mechanism and as 
a consequence the final outdegree is determined only by 
node activity A^. When activity decay fA is sufficiently 
fast to allow the normalisation factor of to converge, 
the asymptotic solution in the continuum approximation 
[32] reads 


kf^^{t) = 1 Pm Ai 


dt' JaX - n) 

^out 


(SIO) 


where is the time at which node i has entered the 
system and = limt^oo Y.i fA{t - n) < 00 . When 
activity decay is absent, this has an asymptotic solution 


k^'^^{t) = 1 + 777 ^ log 


(Sll) 


where a is the average node activity. The outdegree dis¬ 
tribution is consequently determined mainly by the ac¬ 
tivity distribution p{A). 
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SUPPLEMENTARY FIGURES: ANALYSIS OF REAL DATA 




FIG. SI. Temporal decay of the average relevance r(t) (left panel) and activity a(t) (right panel) in Digg.com 
social network (2006-2008, At = Iweek, color online). Symbols represent the average relevance and activity of nodes 
belonging to the same age group, error bars represent the errors of the mean, lines represent the fits described in Section [rv| 



FIG. S2. Temporal decay of the average relevance r(t) in the APS dataset (1893-2009, At = 91 days, color online). 
Symbols represent the average relevance of nodes belonging to the same age group, error bars represent the error of the mean, 
lines represent the fits described in Section [rv| The initial non-monotonous part of the relavance profile is ignored by the fitting 
procedure and consequently the fitted curves do not match the points corresponding to the first few years after publication. 
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APS dataset 


simulation calibrated on APS dataset 




FIG. S3. Age distribution of the top 1 % nodes in the ranking, in the APS data (left panel) and in the cor¬ 
responding calibrated simulation (right panel) (color online). On the x-axis, nodes are ordered by age (oldest on the 
left, youngest on the right). We see that PageRank is more biased towards old nodes than indegree. Results obtained on the 
calibrated model (see Table for model parameters) agree well with those obtained on the real data. 



FIG. S4. Outdegree distribution in the APS dataset. The distribution is narrow and is well approximated by ~ 

exp(—0.12for > 5 (black line). We use this distribution to generate the number of outgoing links created by a new 
node in the simulations calibrated on the APS dataset. 
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SUPPLEMENTARY FIGURES: NUMERICAL SIMULATIONS 

For the following figures, unless stated otherwise, the models’ settings are: 

• RM: N = 10,000, p{r]) = exp (- 7 ^), /i?(t) = exp (-t/6>i?), /^(t) = exp(-t/6>A). 

• EFM: N = 10,000, H = 250, fR{t) = exp (-t/6>i?), /^(t) = exp(-t/6>A). 



FIG. S5. Linking pattern in the extended fitness model (EFM). Each symbol corresponds to a link from a start node 
whose rank according to fitness is shown on the horizontal axis to a target node whose rank according to fitness is shown on 
the vertical axis. The EFM parameters used to generate the network are N = 10,000, Or = 500, Oa = 10000, H = 250. The 
dotted lines mark the rank position 250 that separates high-fitness nodes (whose fitness 77 is uniformly distributed in the range 
[10“^, 1]) and low-fitness nodes (whose 77 is uniformly distributed in the range [0,10“^)). We see that the EFM model produces 
networks where high-fitness nodes are typically pointed by other high-fitness nodes, thus creating a suitable setting for the 
Page Rank algorithm. 



relevance decay Or relevance decay Or 


FIG. S 6 . Average birth time r of the top 1% of nodes as ranked by PageRank in the RM (left) and in the 
EFM (right), color online. When relevance decay is faster than activity decay (upper-left corners of the plots), PageRank 
is biased towards recent nodes (blue-shaded areas) in both models. When the opposite is true (lower-right corners of the plots), 
PageRank is biased towards old nodes (red-shaded areas). Between the two biased regions, there is a nearly-diagonal contour 
(marked with the dotted line) where the average age of top 100 nodes is N/2 = 5000 which means that the top 100 PageRank 
positions show no bias towards recent or old nodes. We can conclude that PageRank is not biased only when the timescales of 
relevance and activity decay are in accord. Results are averaged over 6 model realizations. 
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relevance decay Or relevance decay Or 


FIG. S7. Indegree-PageRank correlation in the RM (left) and EFM (right). The dotted lines represent the zero-bias 
contours from Figure [S^ When the timescales of relevance and activity decay mismatch (upper-left and lower-right corners of 
the heatmaps), indegree-PageRank correlation is weak due to the time bias of PageRank. This correlation is maximal near the 
contour where PageRank is not biased. 



relevance decay Or relevance decay Or 


FIG. S8. Fitness-PageRank correlation in the RM (left) and EFM (right). The dotted lines represent the zero-bias 


contours from Figure S6 When the timescales of relevance and activity decay mismatch (upper-left and lower-right corners 
of the heatmaps), fit ness-PageRank correlation is weak due to the time bias of PageRank. PageRank performs best along the 
zero-bias contour. Note that while the global maximum of PageRank’s performance for the RM occurs when both Oa and Or 
are small (lower-left corner), the global maximum for the EFM is located in the center area. This happens because in the 
EFM, only a small fraction (5%) of nodes are sensitive to fitness; as a result, when activity and relevance decays are too fast, 
fluctuations damage the capability of indegree and PageRank to efficiently detect fitness. By contrast, in the RM all nodes are 
sensitive to fitness; for this reason. Or = 10 and Oa = 10 are large enough to allow the system to significantly perceive fitness. 
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FIG. S9. Comparison between PageRank and indegree in the RM with uniform fitness distribution (color online). 
Panels show the correlation r{k^'^,p) (left) and the correlation ratio r{p,r]),r]) (right) in the RM with node fitness p 
distributed uniformly in [0,1] (as opposed to the original exponential distribution). The dotted lines represent the PageRank’s 
zero-bias contour for this model (bias is again evaluated on the basis of the average age in top 100 positions of the Pagerank 
ranking). Results show no significant differences from the results on the RM with an exponential fitness distribution (see the 
left panel in Figure S7 and the right panel in Figure]^ in the main text). 
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FIG. SIO. Comparison between PageRank and indegree in the RM (left) and EFM (right) with power-law aging 

(color online). We see here qualitatively the same behaviour as reported in Figures]^ andin the main text where exponential 
aging is assumed. Two regions where PageRank fails are again present: one in which relevance decays faster than activity 
(lower-right corner) and one where the opposite is true (upper-left corner). In both the RM and the EFM, the regions where 
Page Rank is not heavily outperformed by indegree include also the {an, a a) values found in Digg.com data analysis (see Table 
[^. For simplicity, we focus on the parameter values {aR^aA) — (1,0.4), even if values in Table are slightly different and 
dependent node degree group. In the RM, PageRank performance in ranking nodes by fitness is not far from that of indegree 
[(1, 0.4) lies in the white-shaded area of left panel], which explains why indegree and PageRank performances in ranking nodes 
by relevance are close to each other in Digg.com dataset (see Fig. I^of the main text). In the EFM, PageRank even outperforms 
indegree for these parameter values [(1,0.4) lies in the blue-shaded area of right panel], which illustrates the main difference 
between the favorable and unfavorable parameter regions for PageRank: when PageRank is biased because of temporal effects, 
it fails both in the RM and in the EFM; by contrast, when PageRank is not biased, then its performance with respect to 
indegree depends on the growth rule of the system and can benefit from a suitable model such as the EFM. 
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FIG. Sll. Comparison between PageRank and indegree in the RM (left) and EFM (right) using precision (color 
online). Precision Px{s) of the ranking by score s is the average number of nodes that are among the top X places of the 
fitness ranking that are at the same time at the top X places of the ranking by score s. The dotted lines represent the zero-bias 
contours from Figure [Sb] As for the score comparison on the basis of correlation with node fitness, PageRank’s performance 
is again optimal along the contour of its zero time bias. In comparison with the correlation-based results, PageRank now lags 
less behind indegree in the region where relevance decays slowly and activity decays quickly (lower-right corner). However, 
both scores perform badly in this region (for the EFM, for example, P 25 o(p) = 0.16 in the lower-right corner as opposed to the 
best-achieved precision P 25 o(p) = 0.69 when Or = 183 and Oa = 263). 
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FIG. S12. Comparison between PageRank and indegree in the RM with accelerated growth (color online). We 
grow networks according to the RM with the same parameters as before except that in simulation step t, m{t) = 20t/A^ existing 
nodes are sequentially chosen and create one link each. In this way, the rate at which links are created in the network grows 
linearly with network size. This aims to represent real systems where the total node activity is not normalized but rather grows 
with the number of existing nodes. At the same time, the resulting number of links in the network is the same as before when 
m = 10 nodes were chosen and created one link each which makes the present model comparable with the original one. Panels 
show the correlation (left) and the correlation ratio r{p,r])/r{k'^^,r]) (right). The behaviour is again qualitatively 

similar to that found for uniform growth (see Figures]^ and S7) except that the lower-right region where PageRank fails is not 
even more pronounced. This further demonstrates the generality of our observations. 
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FIG. S13. Comparison of total relevance with indegree (left) and PageRank (right) in the RM (color online). Total 
relevance outperforms indegree and PageRank p in ranking nodes by fitness 77 for a broad range of model parameter. The 
performance ratios are particularly large for slow decay of relevance: for instance, the maximum value of r(T, r])/r{k'^^ ,r]) is found 
for {0r,0a) = (4832,14) [r(T, ? 7 )/r(A;*’^, 77 ) = 2.83], and the maximum value of r{T,r])/r{p,rj) is found for {0r,0a) = (3359,10) 
[r(T, 77 )/r(p, 77) = 12.49]. These findings show that total relevance is highly informative on node fitness and might motivate the 
study of the ranking by total relevance in real data. 
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FIG. S14. Comparison of total relevance with indegree (left) and PageRank (right) in the EFM (color online). In 
the data produced by the EFM, the parameter region where total relevance T outperforms indegree k'^^ and PageRank p in 
ranking nodes by fitness is smaller with respect to that observed for the RM. The performance ratio r{T,p)/r(k^^ becomes 
large when relevance decay is slow and, as a consequence, indegree is heavily biased towards old nodes; the maximum value of 
r{T,p)/r{k^'^,p) is found for {0r,0a) = (10000,10) [r{T,p)/r{k'^^,p) = 3.77]. The maximum value of r{T,p)/r{p,p) is found 
for {0r,0a) = (10000,10) [r(T, 77 )/r(p, 77 ) = 142.57, with r{T,p) = 0.172 and r{p,p) = 0.001]. On the other hand, there are 
broad regions of parameter values where total relevance is outperformed by indegree and PageRank, which leaves the following 
question open: to what extent this failure of total relevance is due to the details of the model, such as the choice of p{p) and 
the functional form of 11 ^’^? Answering this question goes beyond the scope of this work. 

























